Picture for Shuai Yang

Shuai Yang

LongLive-RAG: A General Retrieval-Augmented Framework for Long Video Generation

Add code
Jun 01, 2026
Viaarxiv icon

WorldCraft: From Camera Navigation to Object Manipulation in Interactive Video World Models

Add code
May 24, 2026
Viaarxiv icon

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Add code
May 19, 2026
Viaarxiv icon

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Add code
Apr 24, 2026
Viaarxiv icon

VibeFlow: Versatile Video Chroma-Lux Editing through Self-Supervised Learning

Add code
Apr 15, 2026
Viaarxiv icon

A Progressive Training Strategy for Vision-Language Models to Counteract Spatio-Temporal Hallucinations in Embodied Reasoning

Add code
Apr 12, 2026
Viaarxiv icon

AgentCollab: A Self-Evaluation-Driven Collaboration Paradigm for Efficient LLM Agents

Add code
Mar 27, 2026
Viaarxiv icon

DVD: Deterministic Video Depth Estimation with Generative Priors

Add code
Mar 12, 2026
Viaarxiv icon

ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI

Add code
Feb 15, 2026
Viaarxiv icon

RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation

Add code
Feb 10, 2026
Viaarxiv icon